Automatic Prosody Quality Evaluation of Mandarin Speech
نویسندگان
چکیده
Prosody evaluation is an essential part of computer-aided language learning system. In the paper, we investigate an automatic prosody evaluation method for Mandarin speech. The method is based on prosody comparison between the tested and standard utterance. The prosodic similarities are calculated from three aspects: tone, intonation and rhythm. Based on these similarities, a ranking algorithm named SVOR is proposed and investigated to predict the prosody quality score. The algorithm is evaluated on the collected database and it shows better performance than other well-known algorithms. In addition, detailed analyses of human scoring at the sentence-level are given.
منابع مشابه
An automatic prosody labeling method for Mandarin speech
A new model-based automatic prosody labeling method for Mandarin speech is proposed. It first introduces four models to describe the relationships of the prosody tags to be labeled, the prosodic features of the speech signals, and the linguistic features of the associated texts. It then employs a sequential optimization procedure to estimate parameters of these four models and find all prosody ...
متن کاملProsody Variation: Application to Automatic Prosody Evaluation of Mandarin Speech
Prosody evaluation is an essential part of computer-aided language learning system. In the paper, prosodic variability among inter-speakers is investigated based on a database containing eight repetitions of 200 sentences. For Mandarin of reading style, its variability can be analyzed from rhythm, intonation and tone. Experimental results show that the mean correlation of tone between inter-spe...
متن کاملUsing prosody to improve Mandarin automatic speech recognition
In this paper, these problems of how to model and train Mandarin prosody dependent acoustic model and how to decode input speech based on prosody dependent speech recognition system will be discussed. We use automatic prosody labeling methods to annotate syllable prosodic break type and stress type on continuous speech corpus, and utilize our proposed methods to train prosody dependent tonal sy...
متن کاملLexical Tones Learning with Automatic Music Composition System Considering Prosody of Mandarin Chinese
Recent research has found that there is an overlap in the processing of music and speech in certain aspects. This research focuses on the relationship between the pitch of tones in language and the melody of songs. We present an automatic music composition system based on the prosody rules of Mandarin and we hypothesize that songs generated with our proposed system can help non-native Mandarin ...
متن کاملThe VUB Blizzard Challenge 2010 Entry: Towards Automatic Voice Building
In this paper we describe the voices we submitted to the 2010 Blizzard Challenge, a yearly challenge to evaluate auditory speech synthesis on common data. One of the goals of a datadriven synthesizer, such as ours, is to generalize the speech database in such a way that it allows a realistic rendition of unseen input text. The two main changes to our system, compared to previous submissions, ar...
متن کامل